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Abstract 

Motivated by questions in biological classification, we discuss some elementary combinatorial and 
computational properties of certain set systems that generalize hierarchies, namely, 'patchworks', 
'weak patchworks', 'ample patchworks' and 'saturated patchworks' and also outline how these concepts 
relate to an apparently new 'duality theory' for cluster systems that is based on the fundamental 
concept of 'compatibility' of clusters. 
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1. Introduction 

In various fields of classification, such as evolutionary biology, a nested hierarchy is regarded as the 
ideal object for describing the relationships between the objects under consideration (e.g. species). 
Such a hierarchy corresponds to the 'clades' of a rooted tree, and the notion in taxonomy traces back 
at least to Linnaeus [23], if not Aristotle [2], that is, it was considered years before the concept of an 
evolutionary phylogenetic tree was formed. 

But set systems that are more complex than hierarchies are also frequently relevant in this setting 
for two reasons: Firstly, even when the underlying structure can be assumed to be 'tree- like', the data 
itself may not be, and so the question of whether there are canonical ways to construct a hierarchy from 
an arbitrary set system (or other structure, such as a graph representing, for example, a genealogy, 
or a topological space) arises. This topic has been explored recently |T3], [E], and we briefly consider 
some further aspects of it in this note. 

Our main interest, however, lies in the second reason for dealing with more complex set systems 
than hierarchies; namely to accommodate settings where a tree does not provide an entirely accurate 
classification. In the case of evolutionary biology, for instance, processes such as lateral gene transfer, 
and the formation of hybrid species, give rise to non-treelike evolutionary histories (see, for example, 

[m, m, m)- 

Two distinct approaches have been developed for accommodating non-tree like processes. One 
approach attempts to use overtly graph-theoretic approaches to construct different types of directed 
acyclic graphs that could be appropriate for describing a 'network of life'. The other is to simply 
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consider set systems on taxa that can arise from such networks (e.g. via their so-called 'soft-wired' or 
'hard- wired' clusters [20j) or - more directly - from the complex pattern of the presence and absence 
of genetic markers across taxa. For this second approach, some relaxations of hierarchies have been 
developed, such as the notion of weak hierarchies, pyramids, or ^-compatible set systems (see, for 
example, [IS], [27], [28]). 

Here, we explore a different type of relaxation, viz. various types of 'patchworks' - a class of 
set systems that was introduced into phylogenetics in [?]. These set systems can be generated by 
iteratively enlarging a given 'generating set' (Lemma |7.1| below) and, when the resulting set system 
is sufficiently abundant, it will harbour at least one fully resolved or 'maximal' hierarchy (c/. The- 
orem [T]). Patchworks also turn up naturally in the context of a certain 'duality theory' for cluster 
systems and can be used to associate a canonical hierarchy with an arbitrary set system. Patchworks 
may thus provide a new tool for studying evolutionary relationships in a setting where hierarchies 
can be obscured by reticulate processes such as extensive lateral gene transfer, and for analysing col- 
lections of subsets of taxa created according to the presence /absence patterns of genetic loci thereby 
providing some insight into the extent to which a single tree (as opposed to a more complex network) 
may describe the evolution of the taxa [9] . 

The paper is organized as follows: We introduce some basic definitions and notation in the next 
section, and the main concepts we are going to investigate in Section [3| We collect some relevant 
examples in Section [4j and then present one of our two main results, viz. Theorem [l| in the next 
section. Then, we discuss a duality theory for cluster systems in Section [6] (containing the other main 
result - Theorem [2]- characterizing 'self-adjoint' cluster systems) as well as certain closure operators 
for cluster systems in Section [7j In Section [8j we illustrate these concepts using a biological data set, 
the so-called 'Belgian Transmission Chain' (of the human immunodeficiency virus (HIV), cf. [23]). 
We conclude the paper with a collection of comments and remarks regarding possible applications, 
extensions, and some open questions (Section [9|. 



2. Basic definitions and notation 

We begin with some terminology and definitions. Let X be a finite set of cardinality n at least 2 
that we fix once and for all. 

The partially ordered set consisting of all non-empty subsets of X (ordered by set inclusion 'C') 
whether empty or not is denoted by C{X), every subset C of C{X) is called a cluster system (for X), 
and the set of all cluster systems C Q C{X) is denoted by C^'^\X). Further, given any cluster system 
C G C(2)(X), we denote 

- by IJC := UceC ^ union and by P|C := PlceC ^ intersection of all clusters C £ C, 

- by Max(C) the set {C € C : {C € C : C C C'} = 0} of ah maximal clusters in C, 

- by Min(C) the set {C e C : {C G C : C" C C} = 0} of ah minimal clusters in C, 
and given - in addition - any cluster A G C{X), we denote 

- byC(C^):={CGC:CC A} the cluster system consisting of all sub-clusters of A in C, 

- by C{^A) := {C €z C : C ^ A} the cluster system consisting of all clusters in C containing A, 

- and we put Max(C:^) := Max(C(Cyl) - {A}). 

Two subsets A and B oi X are called compatible - which we denote by 'A||i?' - if and only if 
AnB € {0, A, B} holds; otherwise A and B are said to be incompatible - which we denote by '^jfi?'. 
Two cluster systems A,B €z C^'^\X) are called compatible if A\\B holds for every A G A and B €z B, 
which we denote by '^||;B'. And, given any cluster system C G C^'^\X), 

- we denote by Cextr the collection of those clusters A G C for which no clusters B,C £ C with 
A = B U C and B)^C exist, (i.e., the collection of clusters in C that are extremal relative to the 
'weak patchwork closure operation' to be introduced below), 
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- we denote by C* the largest cluster system in C^^^ (X) that is compatible with C, i.e., the collection 
of all non-empty subsets ^ of X that are compatible with every cluster C G C ~ this cluster 
system will also be called the adjoint of C, 

- and we denote the double adjoint (C*)* of C, i.e., the adjoint of the adjoint of C, simply by C** , 

- while the intersection C* n C** will be denoted by Co- 

Finally, the Hasse diagram He associated with a cluster system C G C^'^\X) is the directed graph 
with vertex set C and arc set 

Ec := {{C2, Ci) E : |{C G C : C72 C C C Ci}| = 2}. 

3. Patchvi^orks 

In many applications, it is useful to consider families of cluster systems that satisfy additional 
constraints. Hierarchies, studied in phylogenetics as well as in combinatorial optimization (where 
they are dubbed laminar families [22]), form a particularly popular example. Formally, a cluster 
system C E C^'^\X) is a called a (generalized) hierarchy on X - or, for short, an X -hierarchy or 
even just a hierarchy - if any two clusters A,B € C are compatible or, equivalently, if C C C* holds. 
Further, a cluster system C E C^'^\X) is called ample if Ci — C2 E C holds for every arc (C2, Ci) E Ec- 

Motivated by their applications in phylogenetics [HI [28] , patchworks were introduced in [6l E] as 
cluster systems C E C^'^\X) for which the following holds: 

(1) A,B eC and A)^B ^ An B,AU B eC. 

Here, we will also be interested in related notions: A cluster system C E C^'^\X) shall be called a 
weak patchwork if it only satisfies the second half of the patchwork condition ([T]): 

(2) A,B £C and A)^B ^ A\J B gC. 

And it will be called a saturated patchwork if, given any two clusters A,B G C with ^-[fS, the five 
clusters AnB,ADB,A — B,B — A and AU B — A n B are also contained in C. In particular, any 
saturated patchwork is a patchwork, and hence is also a weak patchwork. It is also obvious that a 
cluster system C E C^'^\X) is a saturated patchwork if and only if, given any three disjoint clusters 
Co, Ci, C2 E C{X) with Co U Ci, Co U C2 E C, one also has Co, Ci, C2, Ci U C2, Co U Ci U C2 E C. 

The following simple facts regarding such cluster systems are easy to check and/or well-known: 

(FO) Every hierarchy is a saturated patchwork and, so, in particular, is a patchwork, and every 
patchwork is a weak patchwork, while the converse does not hold unless one has n = 2. 

(Fl) The map C^'^\X) C^^\X) : C ^ C* defines a Galois connectioi^ on C^'^\X), i.e., one has 
Ac A** as well as 

''A<^B^B* QA* ^ A** c B**'' 
for all cluster systems A,B G C^'^\X) and, thus, also 

as wen as (C**)* = (C*)** = C* for ah C E C(2)(X). 
(F2) One also has 

^ ^ c ^ e** c X ^ B**\\A ^ x*\\B**'' 

as well as (^U^)* = A* nB* and [AnB)* C ^* uS* for all cluster systems A,B d C^'^\X) (but 
not necessarily ''A\B A*\\B* " as the example X := {1, 2, 3} and A := B := {{1}} shows). 



'^see http : / /en . wikipedia . org/wiki/ Galois_coimection 
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(F3) The adjoint C{X)* of C{X) is the cluster system Ctriv{X) := {C ^ X : \C\ = 1 or C = X} 
consisting of ah trivial clusters in C{X). It is a hierarchy and is contained in C* for every 
cluster system C G C^'^\X), and its union with any other hierarchy, weak patchwork, patchwork, 
or saturated patchwork C C C{X) is also a hierarchy, a weak patchwork, a patchwork, or a 
saturated patchwork, respectively. In particular, one has (C U CtriviX)) = C* and, hence, also 
(C U CtriviX))** = C** for every cluster system C G C^^\X). 

More generally, given any cluster system C G C^'^\X) for which there exists a partition 11 of a 
subset A oi X such that C coincides with (11) := {IJII' : / 11' C 11} (i.e., the disjoint union 
CU{0, A} is the 'ring of sets' generated by 11, cf. [1]), one has C* nC = 11 - while C* -C consists 
of all proper non-empty subsets of the sets in the partition 11 and the subsets of X that either 
contain or are disjoint from A. 

(F4) The cardinality {Til of any X-hierarchy Ti never exceeds 2n — 1, and it coincides with this number 
if and only if is a maximal X-hierarchy if and only if T-L = %* holds. 

(F5) In particular, the following three assertions hold: 

(F5-i) The adjoint C* of any cluster system C C C{X) that contains a maximal X-hierarchy T-L 
must be a hierarchy as it is necessarily contained in the adjoint %* of % which, however, 
coincides with % itself. 

(F5-ii) Conversely, the adjoint %* of any hierarchy % C C{X) contains a maximal X-hierarchy as 
it is actually the union of all X-hierarchies and, hence, also of all maximal X-hierarchies 
that contain %. 

(F5-iii) So, a cluster system C C C{X) is a hierarchy if and only if its adjoint contains a maximal 
X-hierarchy, in which case its double adjoint is also a hierarchy. 

(F6) Also, an X-hierarchy C is a maximal X-hierarchy if and only if X G C holds and either {i) every 
cluster C G C whose cardinality \C\ exceeds 1 is the union of two proper (and, hence, necessarily 
disjoint) sub-clusters Ci, C2 G C or, equivalently, {ii) C is ample and {x} G C holds for all x G X 
(cf. [5] for generalizations regarding maximal X-hierarchies for infinite sets X). 

Recall also (from, for example, [18]) that a cluster system C G C^^-*(X) with X = (JC is a hierarchy 
if and only if C{x) := C\C{D {x}) G C holds for every x G X and the Hasse diagram He is a rooted 
X—/oresQ relative to the labelling map (/)c : X — )• C : x 1— )• C{x) where the roots (respectively leaves) 
are formed by the maximal (respectively minimal) clusters in C ~ see, for example, |13l EH] for more 
details. In particular, given any hierarchy C G C^'^\X) with X = [jC, the following holds: 

(HI) the associated rooted X— forest He is a rooted X— tree (i.e., it is connected as a graph) if and 
only if X G C holds. 

(H2) (pc maps X bijectively onto the set of leaves of He if and only if one has {x} G C for all x G X. 
(H3) He is a rooted binary X-tree with leaf set {{x} : x G X} if and only if C is a maximal X- 
hierarchy (see, for example, [l3l [28] for details). 

Furthermore, associating to every X-hierarchy with X = |J C the corresponding rooted X— forest 
sets up a canonical one-to-one correspondence between such hierarchies and isomorphism classes of 
rooted X— forests, i.e., every rooted X— forest is isomorphic to the rooted X— forest associated to one 
and only one such hierarchy. These concepts are illustrated in Fig. [T] where, for simplicity (here and 
elsewhere), we write a set {xi, . . . , xt} as xi . . . xt when no confusion can occur. 

4. Examples of patchworks 

In this section, we collect some examples of patchworks that arise naturally in the study of cluster 
systems, and present some characterizations of hierarchies via patchworks and adjoint cluster systems. 



^ A rooted X— forest T — (V, E) is a collection of rooted trees together with a function (j> — 4>t ■ X ^ V providing a 
partial labelling of vertices of T such that every unlabelled vertex has degree at least 3. 
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(a) (b) (c) 

Figure 1: The Hasse diagrams of (a) the hierarchy {1, 2, 3, 4, 12, 34, 1234}; (b) the patchwork {1, 2, 3, 4, 23, 123, 234, 1234}; 
and (c) the weak patchwork {1, 2, 3, 4, 123, 234, 1234}. 

(El) Let C be a cluster system for which | |JC| — \^'\ +P holds for some fixed integer p for all non- 
empty subsets C of C. Then the collection of all non-empty subsets C of C with | |J ^'1 = |C| +p 
forms a patchwork since 

I IJ C I + lU c" I = I U C n U c" I + I U C u U c" I > I |J(C' n c") I + 1 |J(C' u c") | 

and |C'| -h |C"| = |C' n C"| |C' U C"\ holds for any two subsets C , C" of C{X) (see also Lemma 
1.2 in I.16j). In phylogenetic combinatorics, this observation is central to supertree construction 
from triplet and quartet trees (with p = 2 and p = 3, respectively; see [3 HSl ESI EH] ) • Moreover, 
the case p = has an interesting combinatorial implication for any cluster system C for X that 
has a 'system of distinct representatives' [19j: If C is such a cluster system, then the 'trivial' 
direction of Hall's classic result implies that the collection of non-empty subsets C of C that 
satisfy | U^'l — l^'l forms a patchwork (see [12] for further related results). 
(E2) Let G = {X, E) be a finite graph with vertex set X and let Cg denote the set of subsets of X 
that are connected relative to G. Then, Cg is a weak patchwork. Moreover, if G is acyclic (i.e. a 
forest of trees), then Cg is a patchwork. More generally, the following can easily be established: 

Lemma 4.1. For any finite graph G with vertex set X, Cg is a patchwork if and only if G is a 
block graph, i.e., a graph in which every 2-connected component or 'block' is a clique. 

(E3) Further examples of saturated patchworks are provided by: 

Proposition 4.2. The adjoint C* of any cluster system C G C^'^\X) is a saturated patchwork 
that contains Ctriv 

Proof: The fact that C* contains Ctriv is obvious - and follows directly from the observations 
(Fl) and (F3). The fact that C* is saturated follows immediately from the following lemma. 

Lemma 4.3. A cluster C in C{X) is compatible with two given clusters A,B G C{X) that are 
not compatible with each other if and only if either one of the following five assertions holds: 

(i) Cn{AuB) = $, (fi) AUB<ZG, (iii) C <^AnB, (iv) C <^ A - B, or (v) C Q B - A. 

In particular, if C is compatible with two clusters A,B £ C{X) that are not compatible with each 
other, then the five clusters AL)B,Ar]B,A — B,B — A, and {A — B) U {B — A) must also be 
compatible with C. 

Proof: If C is compatible with two clusters A,B£ C{X) that are not compatible with each 
other and (i) does not hold, we may assume wlog that C D A ^ (}> and, therefore, either C O A 
or A Q C holds. However, if A C C holds, we must also have C H i? 7^ and, therefore, also 
B (1 C and, hence, AVJ B (1 C ~ i.e. (ii) - holds as C C i? cannot hold in view oi A'^ B. 
Otherwise, we have C A and, therefore, B <^ C. So, either CQB or CnB = f/> must hold 
implying that either C ACi B - i.e. (iii) -oiC'^A — B - i.e. (iii) - holds. 
Conversely, it is obvious that each of these five assertions implies that not only A and B, but 
also the five clusters AUB, AnB, A-B, B-A, and {A-B)U{B-A) must be compatible with C. I 
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(E4) Finally, given any cluster system C G C^'^\X), recall that it was noted in |21|, I29| that the 'ST- 
sets' with respect to C, i.e., the clusters A in its adjoint C* with C\\C' for all subsets C, C of A in 
C, form a weak patchwork. Remarkably, this can now be improved in several ways: Consider an 
arbitrary subset IZ of the set x C(X) and define, given any such subset TZ and any cluster 

system C C C(X), five subsets of C* denoted C*(7^, sub), C*(7^, psub),C*(7^, sup),C*(7^, psup), 
and C*(7^, disj) as follows: 



C*(7^,sub) 


:= {AeC* 


{C, C')en for all C,C' eC with C, C C 




C*(7^,psub) 


:= {AeC* 


(C,C") 


G 7^ for all C,C' &C with C7, C' C 




C*(7^,sup) 


:= {AeC* 


(C,C") 


G 7^ for all C, C" e C with C7, C' D 




C*{TZ, psup) 


:= {AeC* 


(C,C") 


G 7^ for all C, C" e C with C7, C' 2 


A}, and 


c*(7^,disj) 


:= {AeC* 


(C,C") 


G 7e for all C, C e C with (7 n A = 


= C n A = 0} 



Note that 

C*(7^,sub) C r(7^,psub) and C*(7^,sup) C C*(7^,psup) 

always holds, and that C*(7^, sub) and C*(7^,psub) contain - essentially by definition - all 
clusters in C* that are proper sub-clusters of clusters in C*(7^, psub), i.e., 

(3) AeC*,A' e C*(7l,psub),.4 C A' ^ Ae C*(7^,sub) C C*(7l,psub) 

holds. Similarly, C*{Tl, sup) and C*{Tl, psup) contain - also by definition - all clusters in C* that 
properly contain a cluster in C*(7^, psup), i.e., 

(4) A€C*,A' GC*{n, psup) ,A^ A' ^ AeC*{TZ, sup) C C* (7^, psup) 
also holds. And so does 

(5) AeC*,A' e C*{n, disj), A ^ A' ^ A e C*{n, disj). 



Furthermore, given any TZ as above, let T^disj denote the union of TZ and the set of all pairs 
(C, C) e C{X) X C{X) with C n C = 0, let denote the - generally larger - union of TZ and 
the set of aU pairs (C, C") G C{X) x C{X) with C\\C', and note 

||:1) that C*(7^|,sub) = C*(7^|,psub) and C*(7^|,sup) = C*(7^|,psup) always holds as C = A 
imphes C\\C' for ah clusters C G C{X) with either C C A or C" D A, 

II :2) and that also ('7^|)disj = "^i and, hence, also C*(7^|,sub) = C*(7^|,psub) = C* (T^'^^gj , psub) 
and C*(7^|,sup) = C*(7^|,psup) = C*(7^[jjgj, psup) holds for TZ' := 7^|. 

Clearly, the ST-sets from [211 [29] exactly the sets C*(0||,sub). Thus, the observation from 
^ is implied by 



Lemma 4.4. Given any subset TZ of the set C{X) x C{X), the following holds: 

(i) The cluster systems C* {TZ\\, sub) = C*(7^| , psub), C* (T^disj > psub) , C*(7^|,sup) = C*(7^| , psup), 
and C*(7^disj) psup) form saturated patchworks. 

(ii) The two cluster systems C*{TZ^\,disi) and C*(7^disj, disj) form patchworks. 

Proof: (i): In view of (||:1) and (||:2), it suffices to show that the two cluster systems C* (T^disj i psub) 
and C* (T^disj , psup) form saturated patchworks for any subset TZ of the set C{X)xC{X) (including 
those subsets TZ' of C(X) x C{X) that are of the form TZ' = TZ^\ for some subset TZ oiC{X) xC{X)) . 



To do so, recall that, by Lemma [43} either C D {AU B) = oi AU B C C, oi C (1 An B, or 
C'^A — B, or C'^B — A holds for any cluster C in C and any two clusters A, B in C* that 
are not compatible with each other. Thus, if C is also a proper subset of ^ U -B, it must be 
contained in exactly one of the three pairwise disjoint subsets ACi B, A — B, oi B — A. And if 
C properly contains either ACi B, A — B, or B — A, it must contain AU B. 
In consequence. A, B £ C* (T^disj , psub) , A j/[B, C, C G C, and C,C' C AU B implies that either 
C, C C A or C, C" C 5 or C n C = and, hence in any case, (C, C) G 7^disj holds. 
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And A,B £ C*(7^disj,psup), C,C' £ C, A )^ B , and C,C' ^ A n B or C,C' ^ A - B or 
C,C' ^ B — A implies that C,C' 5 AU B and, therefore, also C,C' ^ A and, hence, again 
(C, C) G 7^disj holds. 

So, ^, i? G C* (T^disj ) psub) and A j/[ B implies AU B £ C*(7^disj5 psub) and, therefore, also 
AnB,A- B,B - A, {A- B)LI{B - A) £ C*(7^disj, psub) in view of ^. 

And A,B £ C*(7^disj, psup) and A ^ B implies An B,A - B,B - A, {A - B) U {B - A) £ 
C*(7^disji psup) while AU B £ C*(7^disji psup) holds anyway in view of Q. 
In consequence, the cluster systems C*(7^disj) psub) and C*(7^disj) psup) and, hence, in particular 
also the cluster systems C*(7^|,sub) = C*(7^|,psub) and C*(7^|,sup) = C*(7?j| , psup) all are 
saturated patchworks for every subset TZ of C{X) x C{X). 



(ii): Finally, Lemma [43| implies also that C n {AU B) = ^ or C Q A - B or C Q B - A holds 
for any cluster C in C and any two clusters A, B in C* that are not compatible with each other 
and for which Cn{AnB) = holds. 

Thus, if C, C are two clusters with C D {An B) = C D {An B) = we have either C n C" = 
OTCnA = C'nA = (/}oTCnB = C'nB = (/} and, therefore, in any case (C, C") £ 7^disj• 
So, A,B £ C*(7^disj, disj) and A 1/\B implies AnB £ C*(7^disj, disj) while AU B £ C*(7^disj, disj) 
holds anyway in view of (§. 

In consequence, the cluster systems C*(7^disj) disj) and C*(7^|,disj) form patchworks for every 
subset n of C{X) xC{X). I 

Remark We leave it to the interested reader to construct an example of a cluster system C C C{X) 
for which C*(0||,disj) does not form a saturated patchwork. 

5. Ample patchworks 

While it is obvious that any cluster system C £ C^'^\X) that contains, for every arc (C2, Ci) £ Eq, 
a maximal X-hierarchy % with Ci,C2 £ % must be ample, it was observed in [B] that - somehow 
conversely - a patchwork that contains Ctriv is ample if and only if it contains a maximal X-hierarchy, 
and in [8j that a weak patchwork C £ C^'^\X) that contains CtHv is ample if and only if the following 
apparently stronger assertion holds: every hierarchy T-L can be extended within C to a maximal X- 
hierarchy, i.e., to a maximal X-hierarchy %' ^H. with %' C C. In particular, any such patchwork must 



contain some maximal X-hierarchy. Combining these facts with Proposition 4.2 and our observations 
collected in (F5), we obtain: 

Theorem 1. A cluster system C £ C^'^\X) is a hierarchy if and only if the adjoint C* of C is ample 
and - hence - an ample saturated patchwork that contains Ctriv Conversely, the adjoint C* of any 
ample weak patchwork C £ C^^\X) that contains Ctriv is a hierarchy, implying that its double adjoint 
C** must also be ample and - hence, just as above - an ample saturated patchwork that contains Ctriv 

Proof: Indeed, if C is a hierarchy, its adjoint C* contains a maximal hierarchy in view of (F5-ii) and it is 



a patchwork by Proposition 4.2, It must, therefore, be ample in view of the "if" direction of the results 
obtained in |6]. Conversely, if C* is ample, it must - in view of the fact that it is a patchwork that 
contains Ctriv and the "only if" direction - contain a maximal hierarchy. So, its adjoint (C*)* = C** 
and, hence, also C itself must be a hierarchy in view of (F5-i). 

The remaining claim regarding the adjoints C* of ample weak patchworks follows from the fact 
that every ample weak patchwork C £ C^'^\X) that contains Ctriv rnust, by [8|, contain a maximal 
hierarchy. So, its adjoint C* must be a hierarchy, again by (F5-i), and its double adjoint, therefore, is 
an ample saturated patchwork that contains Ctriv ■ 

Theorem [1] supplies the only equivalence that is not perfectly trivial in the following ten equivalent 
characterizations of hierarchies: 

Proposition 5.1. Given any cluster system C £ C^'^\X), the following ten assertions are equivalent: 
(i) C is a hierarchy, (ii) C is contained in C* , (ill) C is contained in Co, (iv) every subsystem C of C 
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is a saturated patchwork, (v) every subsystem C of C is a patchwork, (vi) every subsystem C of C is 
a weak patchwork, (vii) C is a weak patchwork and coincides with Cextr, (viii) Cextr is contained in 
C* , (ix) C* contains a maximal X -hierarchy, (x) C* is ample. 

Proof: It is obvious that the imphcations "(i) <;=^ (ii) <^=^ (iii)" '^^^^ ^ "(i)=^(iv)=^(v)=^(vi)" 
hold and that, in turn, (vi) implies that C can not contain any pair A, B of incompatible clusters. So, 
also "(vi)=^>(i)" holds. Further, (i) implies (vii) as (i) implies (vi) and C = Cextr-, and (vii) implies 
(i) as the existence of a pair A, B of incompatible clusters in a weak patchwork C would imply that 
their union A[J B would be contained in C, but not in Cextr- 

It is also obvious that (ii) implies (viii) while, conversely, "(viii) ^ (i)" holds because if C were 
not a hierarchy while (viii) holds for C, we could choose a minimal pair A,B G C with A)^B, (i.e., a 
pair A,B eC so that A', B' G C, A' C A, B' C B, and A')^B' imphes A = A' and B = B') while, in 
view of (viii), we must also have A Cextr ^ that is, A = j4iU A2 for some Ai, A2 G C with Ai)^A2- By 



the minimality assumption, this would implyAi||i? and A2||i? and, hence, A\\B in view of Lemma 4.3 
a contradiction. 

Finally, the equivalence of (i) and (ix) has been noted already in (F5-iii), and that of (ix) and 
(x) in Theorem [l| I 

Remark Although the double adjoint C** of any ample cluster system C G C^'^^X) that contains Ctriv 
must also be an ample cluster system that contains Ctriv, such a cluster system C does not need to 
coincide with its double adjoint C** as the example X := {1, 2, 3} and C := {1, 2, 3, 12, 23, 123} clearly 
shows. However, as we shall see in the next section, C = C** will hold in case C is also saturated. 
Indeed, we will show there that an arbitrary cluster system C E C^'^\X) coincides with its double 
adjoint if and only if it is a saturated patchwork that contains Ctriv 



6. A duality theory for cluster systems 

We have noted above that, given a cluster system C G C^'^\X), the adjoint C* is a saturated 
patchwork that contains Ctrivi^). Here, we will show that, conversely, a cluster system C £ C^'^\X) 
is of the form C = A* for some cluster system A G C^'^\X) whenever - and, therefore, if and only if 
- it is saturated and contains Ctriv{^)- To this end, we note first: 

Lemma 6.1. Given a weak patchwork C € C^'^\X) and any cluster A in C with at least two distinct 
maximal proper sub-clusters in C - i.e., with |Max(C : ^)| > 2 - either one of the following two 
mutually exclusive assertions holds: 

[Ml-A] C(C A)||Max(C : A) holds ~ in particular, any two distinct clusters in Max(C:^) are disjoint. 

\M2-A] Cr\C' and C U C = A holds for any two distinct clusters C, C in Max(C :^). 

Furthermore, if C is saturated, then \M1-A\ holds for some cluster A inC with |Max(C:74)| > 2 if and 
only if one has Max(C :A) C C* in which case the cluster system 

(Max(C -.A)) ■.= {[jM:^^M(Z Max(C : A)} 

must also be contained in C* . 

Proof: Note first that, if C, C are two distinct clusters in Max(C:^) with C H C" ^ 0, we must have 
C U C = ^ as weh as C n C" / for every other cluster C" in Max(C : A) := Max(C : A). Indeed, 
C / C and C n C" / implies C U C e C and C,C' <^ C U C <Z A and hence C U C = A, as 
claimed. And if C" is yet another cluster in Max(C -.A), we must have C" fl C" C C" = C" D A = 
C" n (C U C") = {C" n C) U {C" n C), and hence C" D C ^ 0, also as claimed. 

Furthermore, if [Ml-A] holds, C is any cluster in Max(C:^), and B is any cluster in C(CA), then 
we must have either BCCorBriC = 9,asB<^C and i? n C / would imply that B' ^ C and 
5' n C / would hold for any maximal cluster B' G Max(C : A) with B C B'. So, [Ml-^] cannot hold 
in this case. This clearly implies the first claim. 
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To establish the second claim, assume that C e C(2)(X) is a saturated patchwork, that A is a 
cluster in C with |Max(C:^)| > 2 for which [Ml-yl] holds, and that is a non-empty proper subset 
of Max(C : A). Then, we must also have M := |J € C* because, if B is any cluster in C, we must 
have 5 II M in case B\\A, &s A (1 B implies M (1 B, Ar\ B = % implies M n 5 = 0, and B (1 A 
implies i?CMorMni? = 0, depending on whether the unique cluster C G Max(C: A) with B C. C 
is contained in j\4 or not. And if i? A holds, BCi A and A — B must be disjoint proper subsets of A 
in C for which the union is A. So, if [Ml-^] holds, we must have Max(C : A) = {B Ci A,A — B} and, 
therefore, either M = {An B} and M = An B, oi M = {A — B} and M = A — B, which imphes in 
both cases that i?||M holds. I 

We are now ready to establish the following theorem: 

Theorem 2. A cluster system C G C^'^\X) coincides with C** or - equivalently, cf. (Fl)- is of the 
form C = A* for some cluster system A € C^'^\X) if and only if it is a saturated patchwork that 
contains Ctriv 

In particular, %** = "H U Ctriv{X) holds for every hierarchy % C C{X) and, more generally, 
C** = CL)Ctriv{X) holds for every saturated patchwork C Q C{X), while %** = % holds for a hierarchy 
% C C{X) if and only ifH contains CtriviX). 

Proof: It suffices to show that if C is saturated and contains Ctriv{X), there exist no clusters in C** — C. 
Otherwise, assume that S is a cluster in C** — C of minimal cardinality. In view of X £ C, there 
exists a unique smallest cluster A in C with B <Z A, viz. A := f^C{^ B). If there were two clusters 
Ci, C2 e Max(C : A) with Co := Ci n C2 / and hence Ci U C2 = A and C3 := A - Cq e C, we must 
have B ^ Ci,C2,C3 and, therefore, \B n Ci|, \B n C2I < |-B| as well as i? n Co 7^ and, therefore, 
S n Ci, 5 n C2 / 0. Thus, B e C** and Ci, C2 G C C C** implies B n Ci, B n C2 e C** in view of 



Proposition 4.2 So, our choice of B implies that B n Ci, B n C2 S C must also hold and, therefore, 

B = BnA = Bn{CiUC2) = {BnCi)u{BnC2) ec in view of {B n Ci) n n C2) = 5 n Co 0, 

which is a contradiction. 



So, by Lemma 6.1, [Ml-^] must hold and, therefore, also \JM £ C* for any proper non-empty 
subset Ai of Max(C : A) <Z C while, in view of our assumption Ctrm ^ C, the subsets in Max(C : A) 
must actually form a partition of A. Thus, any cluster B G C** — C with B C. A = f^C{'D B) must 
be the union of all those subsets C in Max(C : A) with which it has a non-empty intersection (as 
C G Max(C : A) and, therefore, C eC*, B eC** and 5 n C 7^ imply C C B because B CC cannot 
hold by our choice of A). That is, we must have B G (Max(C:A)) C C* and, hence, 

B £ (Max(C : A)) n C** C (Max(C : A)) n (Max(C : A))* = Max(C : A) C C 

in view of (Fl) and (F3), in contradiction to our assumption C ^ C. The last remarks follow directly 
from (FO) and (F3). I 

Combining Theorem [2] with our observations in Theorem [T| we can now give a detailed description 
of the structure of all ample and saturated patchworks that contain Ctriv- 

Proposition 6.2. A cluster system C G C^'^\X) is an ample and saturated patchwork that contains 
Ctriv if CLnd only if there exists a {necessarily unique) hierarchy Ti G C^'^\X) containing Ctriv (viz., the 
hierarchy % := C*) such that C is the disjoint union of the set {X} and the sets {M.ax{'H : A)) where 
A ranges over all clusters in % of cardinality at least 2. In particular: 

\C\ = l+ Yj (2l^^"(^^^)l - 2) 
Aec*, |A|>1 

must hold in this case. 

Proof: We have already noted in Theorem [T] that the adjoint C* of any ample weak patchwork 
C G C(2)(^) that contains 

Ctriv must be a hierarchy that contains Ctriv So, writing Ti for the 
hierarchy C* , we must have C = %* by Theorem [2] in case C G C^'^\X) is an ample and saturated 
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patchwork that contains Ctriv It is also obvious that Lemma 6.1 imphes that (Max(7^ : A)) C T-l* = C 
holds for every cluster A £ 71 with |^| > 1 and that the various subsets of C of the form (Max(?^ : A)) 
for some such A G Ti must be disjoint for distinct clusters A as, given some non-empty and proper 
subset Ai of Ma.x('H : A), A must be the unique smallest cluster in T-L with [JA4 A. Further, 
|(Max(^ : A))\ = 2^^^^^'^--^)\ - 2 clearly holds for every AeTi with \A\ > 1. 

Thus, it suffices to note that, given any cluster C G '?^*with C ^ X, one has C G {Max{'H : A)) 
for that unique smallest cluster A £ T-L with CCA, viz. the intersection of all cluster A' £ H with 
CCA'. However: 

- C must be compatible with all clusters C £ Max('H : A) C C; 

- by construction, C is properly contained in A but not in any cluster C' £ Max{T-L : A); and 

- Max(T-L : A) must be a partition of A in view of our assumption Ctriv{X) C "H. 

So, we have C Q C for every cluster C £ Max('H : A) with C n C 7^ and, therefore, - in 
view ofC = CnA = Cn [jMax{n : A) = Uc'eMax(-H:A) C D C - also C = [jM{C) for 
M{C) := {C £ Max{n : A) : C7 n C" / 0} which, in turn, implies / M{C) C Max{n : A) 
and, therefore, also C £ (Max(T-L : A)), as claimed. I 



7. Generators for patchvi^orks 

Obviously, the intersection Ci H C2 of any two patchworks Ci and C2 is a patchwork, too. And the 
same holds for weak or saturated patchworks. In addition, the cluster system C{X) is a saturated 
patchwork and hence is also a (weak) patchwork. Therefore, we can associate, to any cluster system 
C £ C'-^^(X), the following patchworks "generated by C": (i) The unique minimal weak patchwork 
[C]iofc) (ii) the ~ generally larger - unique minimal patchwork [C]pt, and (iii) the - generally still 
larger - unique minimal saturated patchwork [C]st that, respectively, contain C. These patchworks are 
also called the weak patchwork closure, the patchwork closure, and the saturated patchwork closure of 
C. Clearly, the corresponding three operators P^k, Ppt, Pst- C^'^\X) — )• C^'^\X) that map any cluster 
system C £ C^'^\X) onto its weak patchwork closure [C]wk, its patchwork closure [C]pt and its saturated 
patchwork closure [C]st, respectively, are 'closure operators' on C^'^\X). 

Further, we have C Q [C]^k Q [C]pt Q [C]st Q C** for every C £ C^^\X). So, A,B £ C^^\X) and 
A\\13 imply not only A**\\13** as observed in (F2), but also [AUk\\[l3Uk, [A]pt\\[13]pt and [^],t||[i3],t. 

Note further that [CuCtriv]wk = [C]wk ^ Ctriv, {Cl->Ctriv]pt = [C]pt^ Ctriv, and [CuCtrivjst = 
[C]st^Ctriv must hold for every cluster system C £ C^'^^X) in view of (F3) as well as C** = [C]st^Ctriv- 



Indeed, [C]st Q C** and, therefore, also [C]st U Ctriv Q C** must hold in view of Proposition 4.2 while 
C** C {[C]stUCtriv)** must - in view of C C [CUuCtr^v - hold by (Fl), and {[C]stUCtr^v)** = [CUuCtriv 
must hold by Theorem [2| as [C]st ^ Ctriv = [C LlCtrivjst is a saturated patchwork that contains Ctriv 

It is also worth noting that our results imply that the saturated closure [C]st of an ample weak 
patchwork C £ C^'^\X) that contains Ctriv is also ample, as [C]st = C** must hold and C** must be 
ample by Theorem [l] as this theorem implies that C* that must be a hierarchy which, in turn, implies 
that C** = (C*)* must be ample. 

Next, given any cluster system C £ C^'^\X) and any k £ Nq, we define: 

(i) its weak k-extension, denoted ext^"''^^(C), by setting extQ'"'^^(C) := C and letting ext^'^'^^-'(C) denote 
the union of ext^"''^''(C) and the collection of all clusters A £ C{X) that are the union of any two 
two incompatible clusters in ext^'"'^^(C); 

(ii) its k-extension, denoted ext^^^\c), by setting ext[/'*^(C) := C, and letting ext^^^^(C) denote the 
collection of all clusters C £ C{X) that are the union or intersection of any two incompatible 
clusters in ext^"''^''(C); and 
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(iii) its saturated k-extension, denoted ext^* (C), by setting extg '{C) := C and letting ext^'^^',^ (C) 
denote the union of extjJ'*^(C) and the collection of all clusters C G C{X) for which two incom- 
patible clusters A,B & exti.**^(C) with C & {Al} B, Ar\ B, A - B,B - A,{A - B) 1} {B - A)} 
exist. 

It is also obvious that, for "yy":="u;A;", "pf or "sf , we have: 

ext(^^^)(C) C eid!^^ll{C) = exti^^)(exti^^)(C)). 
As one should expect, ^-extensions can be used to construct patchwork closures explicitly: 



Lemma 7.1. Given any cluster system C € C^'^\X), one has: 

\Je^t'i^''\c) = [CU, Uextf)(C) = [CW, and (J extf ^(C) = [C],*. 

k k k 

Proof: With 'yy":=''wk", "pt" or "st" as above, we clearly have C C extS^^^(C) C [C]yy and, therefore, 
also [C]yy C [ext^^^^(C)]j^y C [[CJyylyj/ = [C]yy, I.e., [C]yy = [cxt ^^^^ (C)] . lu conscquence, we also have 
[exti^^^;(C)],, = [extS^^)(ext(^^)(C))],, = [exti^^)(C)],, and hence ext(^^)(C) C [extl^^)(C)],, = [C]yy 
for all fc G No. 

It remains to note that also [C]yy C ext^^^^ (C) holds, which will follow from noting that 
ext^^^"* (C) is a weak patchwork for ''yy" = "luA;" , a patchwork for "yy" = "pi" , and a saturated 
patchwork for "yy" = "sp". Indeed, given A,B ^ Ufc with Ajf(B, there exists a smallest 

natural number fco > with A,Be: extj,^^^(C). So, by construction, we have AU B E ext^^^^^(C) C 
\Jk^'^k"'\c) in case "yy" = "wk'\ we have AUB,AnB e ext^^*|i(C) C Ujk ext^^*^ (C) in case "yy" 
= "pt" , and we have ,AUB,AnB,A-B,B-A,AUB-AnBe e^it'j^^l^ (C) C extj.^*^ (C) in case 
"yy" = "si" , as claimed. I 

For example, for X := {1,2,3} and C := {12,23}, we have = exti"'''^(C) = {12,23,123}, 

[C]pt = e^t'f^\c) = {2, 12, 23, 123} and [C]st = extS^*^(C) = {1, 2, 3, 12, 13, 23, 123}. 

Remarkably, the clusters in the weak patchwork generated by a cluster system can also be char- 
acterized as follows: Given any cluster system C G C^'^\X) and a subset A of X, the incidence graph 
T{A;C) := (^A, E(A;C)^ of A relative to C to be the simple graph with vertex set A and edge set 
E{A;C) := {{x,y} C : there exists some B with {x,y} C 5 C ^}. Then, we have 

Proposition 7.2. Given any cluster system C G C^'^\X) and any cluster A G C{X) with \A\ > 2, the 
following four assertions are all equivalent: 

(i) A G [C]y,k; 

(ii) T{A;C) is connected; 

(iii) there exists some k < \A\ and clusters Ai,A2, . . . , ^ C with 

A = AiU---UAk and (Ai U • • • U Aj-i)j^Aj for allj = 2,..., k; 

(iv) AGextS(C). 

Proof: (i) ^ (ii): This follows from the fact that the cluster system {C G C{X) : V{A\ C) is connected} 
contains C and is a weak patchwork as A, A' G C{X) implies that - in view of E{A]C) U E{A';C) C 
E{A U A'; C) - the graph r(^ [J A' -C) = {AVJ A\ E{A U A';C)) is connected provided the two graphs 
V{A;C) and T{A';C) are connected and ^ n ^' 7^ holds. 

(ii) => (iii): If V{A;C) is connected, we may choose an arbitrary edge {xi,yi} G E{A;C) and 
find at least some cluster ^1 C A in C with {xi,y\} C A^. And if A' is any subset of A for which 
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clusters ^'i, . . . yl'^, G C with ^' = yi; U • • • U A'^, and (^'^ U • • • U A'-_^))^A'- for all j = 2, . . . , fc' 
exist, our assumption that T{A]C) is connected implies in case A' A the existence of some edge 
{x,y} G E{A;C) with x G A' and y ^ A' and, therefore, some additional cluster ^^/_|_]^ C ^ in C with 
{x,y} C A'f^i_^_i and, hence, A'-H^'^/^j^^. Thus, A = AiU ■ ■ - U A^ must hold for any maximal sequence 
of clusters Ai, . . . , A^ Q A in C with (Ai U • • • U Aj-i))^Aj. Furthermore, we must have fc < 1^41 in this 
case, as (Ai U • • ■LlAj_i))^Aj implies that {AiU ■ ■ - U Aj^i\ < |^i U • • • U U for all j = 2, . . . ,k 
and, therefore, | A| = | Ai U • • • U Afc| > 1 + |^i U • • • U Ak-i\ > ■ ■ ■ > k - 1 + \Ai\ > k. 

Finally, the implications '(iii) =^ (iv) ^ (i)' are obvious. I 

It is also worth noting that, among all cluster systems that generate a given weak patchwork C, 
there exists a unique minimal one that we shall also call the base for C, viz., the cluster system Cextr- 

Proposition 7.3. Let C G C^'^\X) be a weak patchwork. Then, one has [Cj^k = C for some cluster 
system C (^C if and only if C contains Cextr ■ 

Proof: This is a direct consequence of the fact that the operator P^^ : C^'^\X) C^^\X) satisfies the 
so-called 'anti-exchange axiom' (see jUj), i.e., if C G C^'^\X) is a cluster system, Ci,C2 G C{X) are 
two distinct subsets of X, and neither Ci nor C2 belongs to [C]u,k, but Ci belongs to [C U {{C2}}]iufc 
then C2 does not belong to [C U {{Ci}}]^^,^ (as Ci G [C U {{C2}}]«)fc - [C]wk implies IC2I < |Ci|). I 

In this context, the following is also worth noting: 

(1) The analogue of Proposition |7.3| does not hold for (proper) patchworks. For example, we have 
[{12,14, 134}]pt = [{12, 124, 134}]pt = C for the patchwork C := {1,12,14,124,134,1234}, but 
[{12,134}]pi = {1,12,134,1234} ^ C holds for the intersection {12,134} of {12,14,134} and 
{12, 124, 134}. The example shows also that the closure operator Ppt does not satisfy the anti- 
exchange axiom in view of 14, 124 [{12, 134}]pt = {1, 12, 134, 1234}, but 124 G [{12, 134, 14}]pt 
and 14 G [{12, 134, 124}]^*. 

Remarkably, it is even possible for a patchwork C to have two disjoint non-empty subsets 0i, 02 
with [I3i]pt = [S2]pt = C, but [13]pt 7^ C for every proper subset B of either Bi or B2. Indeed, 
if C denotes the collection of all non-empty subsets of {1,2,3,4} that contain the element 1, 
then C forms a patchwork for which [Bi]pt = [B2]pt = C holds for Bi := {12,13,14} and 
B2 ■= {123, 124, 134}, but no proper subset of either Bi or B2 generates C. 

(2) Obviously, the base C := C{X)extr of C{X) consists of all clusters C C X of size 1 and 2. This 
shows that the size of [C]^fc can be exponential compared to that of C. 

(3) It is also possible for the base of a weak patchwork C G C^'^\X) to be exponential in the 
cardinality of X. For example, if n is even, let C be the set of all subsets of X of size at least 
|n. Then, the base Cextr of C coincides with the set of all subsets of X of size exactly |n, and 
its cardinality is exponential in n. 

8. An illustrative analysis of a biological data set 

To illustrate our results, we now present a simple application to a 'real-world' data set: the 
so-called 'Belgian Transmission Chain' of the human immunodeficiency virus studied by Lemey et 
al. in [23]. The original data set X := {^95, ^oo, ^90, ^96, ^^94, (^02, -Doi, -E'oi, -^02, G02, -ffgs, ^99} was 
downloaded from |25j . It contains 12 env-gp41 HIV sequences from nine patients. The letters indicate 
the nine distinct patients A,B, . . . ,1 and the indices give the date of isolation. After aligning the 
sequences, eliminating all sites that contained indels or nucleotides that could not clearly be identified 
as either purine (R) or pyrimidine (Y), and then rewriting the sequences simply as R, ^-sequences, we 
obtained an alignment of 12 sequences with altogether 902 sites of which 875 were constant while the 



12 



remaining 27 non-constant sites induced a split systen:|^ '^hiv containing seven distinct trivial splits 
of total multiplicity 14, four distinct 2-splits of total multiplicity 7, two distinct 3-splits of multiplicity 
1 each, one 4-split of multiplicity 1, and two distinct 5-splits of total multiplicity 3. A corresponding 
splits graph (c/. Chapter 4.4 in [I3j) is depicted in Figure [2] in which, for clarity of presentation, all 
seven 1-splits separating each one of the seven sequences Aqq, Aqq, Bqq, C02, -Dqi, Fq2, and I99 from the 
other 11 sequences are omitted, and edge lengths are chosen only to avoid ambiguous overlapping and 
do not indicate 'biological weight' or multiplicity. 

-^02 G02 



o o o 




C02, Eqi 



o o 



-B90 Hgs 

Figure 2: The splits grapli for the non-trivial splits in 'Ehiv- Here "•" stands for the three sequences Cg4,Doi and /gg. 

We then formed 24 distinct cluster systems: 12 by forming, for every sequence x £ X, the cluster 
system that contains all 'split halves' that contain x, and 12 that contain exactly all the complements 
of these split halves. We then computed their adjoints and their double adjoints. As was to be 
expectecQ the adjoints and the double adjoints computed for the first 12 cluster systems were rather 
trivial; the adjoints either consisted of CtriviX) or, in three cases, contained one additional cluster of 
cardinality 2, implying that the double adjoints either coincide with C{X) and, thus, have cardinality 
4049 = 2^^ — 1 or - again in those three cases - with a subset of C{X) of cardinality 2049. The adjoints 
and the double adjoints computed for the other 12 cluster systems are more interesting: Ten of the 
adjoints have cardinality 19, one adjoint associated with Eqi has cardinality 21 and the largest one 
associated with C94 has cardinality 25 while ten of the double adjoints have cardinality 262 and those 
associated with Eqi and C94 have cardinality 261. None of them is a hierarchy, and all but one of 
the hierarchies obtained as intersections of the adjoints and the double adjoints contain 16 clusters, 
that is, three more than there are clusters in Ctriv{X). Furthermore, either the cluster {Co2,-E'oi} or 
X — {C02, -Eqi} and either {C94, Dqi,Iqq} ot X — {C94, Dqi, Iqq} turn up invariably. 

So, while this does not provide too much information (as had to be expected in view of the fact that 
the split system Tihiv is not even weakly compatible), it is remarkable that our analysis does not only 
strongly support the cluster {Co2,-E'oi} that is also supported by all other methods we investigated, 
but also the cluster {C94, DqI; -^99} even though it is generally not supported by other methods and 
the corresponding split {C94, Dqi, 199} |X — {C94, Dqi, Igg} is not even one of the 16 splits induced by 
the 27 non-constant sites of the input alignment. 



^As usual (see, for example, [131 128j). we call a bipartition or split S = {A, B} of the set X into two non-empty 
disjoint subsets A,B of X to be a k-split if min(|A|, \B\) — k holds, and trivial if it is a 1-split. Two splits S and S' of 
X are called compatible if An A' — III holds for some A £ S and some A' £ S' . A collection of splits of X - or a split 
system - is called compatible if any two splits in that collection are compatible. And it is called weakly compatible if one 
of the four intersections AnA'n A", An(X ~ A')n{X - A"), {X - A) n A' n {X - A"), and (X-A)n{X~ A') n A" 
is empty for any three splits S, S' , S" in that collection and all A G S,A' G S' and A" G S" . 

''As even for a 'compatible split system' corresponding to a phylogenetic tree, there would be lots of overlap. 
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In a subsequent paper dealing with applications, we will discuss this and related phenomena and 
their possible biological significance in more detail. 

9. Concluding remarks 

(1) The observation that Co is a hierarchy for every cluster system C G C^'^\X) allows us to associate 
a canonical hierarchy to an arbitrary cluster system that can actually be further enlarged ~ but 
not canonically - by adding any arbitrary subset in C or in C* , or even by forming the union 
Co U C U C" where C is any arbitrarily chosen hierarchy in C** and C" is an arbitrarily chosen 
hierarchy in C* . This generalizes a construction described in [14J where we discussed the problem 
of relating, at least on a purely theoretical level, species trees to genealogical history of individual 
organisms. It was the starting point for the investigations presented here, and it might be of 
interest to relate the discussion in [H] and related discussions in [U |3l [26] to the constructions 
presented here. 

(2) Next, we briefly outline two possible phylogenetic applications of some of our results (for con- 
ciseness, we refer readers who are unfamiliar with some of the phylogenetic terminology to 
[HI EE]): 

For the first application, suppose we have a collection TZ of rooted phylogenetic trees having leaf 
sets that comprise subsets of X. We use the symbol 'a6|c', for any three distinct elements a,b,c G 
X, as a shorthand for the 'rooted triplet' formed by o, 6 'versus' c, i.e., the pair ({a, 6},{c}) of 
subsets of X consisting of the 2- subset {a,b} and the 1- subset {c} of X. Using this notation, 
consider the set TZ^ of 'rooted triplets' xy\z 'displayed' by the trees in TZ, i.e., all triplets xy\z 
for which some tree Tq £ TZ and some edge cq in Tq exist such that 

(i) x,y, and z are leaves of Tq and 

(ii) Co separates x and y from z and the root of Tq. 

Now, for real data, TZ3 will typically be 'incompatible' (i.e. no tree will display all the rooted 
triplets in 7^3). However, we might hope that TZ3 is 'sufficiently comprehensive', i.e., that some 
(unknown) subset TZ'^ of TZ3 'defines' some hopefully true species tree T with leaf set X, in the 
sense that T is the only tree that displays the rooted triplet trees in TZ'^. In this case, it is 
well known that T must be a 'fully resolved' tree (i.e., the 'clades' of T, i.e., the sets of leaves 
that can be separated from its root by some edge, must form a maximal X-hierarchy Ct)- In 
general, the collection of all clusters present in the trees in the original collection TZ may fail to 
contain the clusters of T. However, the weak patchwork closure provides a formal way to obtain 
a superset of Ct from TZ3 and thus, at least, captures all of its clades. More precisely we have 
the following result: 

Proposition 9.1. IfTZ^ contains a set of rooted triplets from X that defines a rooted phyloge- 
netic tree T on X, then Ct is a subset of [Cn^\wk where Cn^ := {{x,y} : 3z£xxy\z G TZ3}. In 
particular, the adjoint C^^ of C-ji^ must be a hierarchy that is contained in C^ = Ct, and its 
double adjoint must be ample. 

The proof of this result in the special case where TZ itself defines T uses induction on the height of 
T, together with the well-known result that, if TZ defines T, then each interior edge of T must be 
'distinguished' by at least one rooted triplet from TZ. The general case where TZ merely contains 
some subset TZ'^, of rooted triplets that defines T follows immediately, since [Ct^^J^/c C [CT^gj^yfe 
holds for every subset TZ'^ of TZ3. 

As a second possible application, suppose that (i) T is a rooted binary phylogenetic tree with 
leaf set X and (ii) T' is some 'perturbation' of this tree obtained by applying some subtree 
rearrangement operation to T (i.e., re-attaching some subtree of T to a different part of the 
tree, as described further in [28]). Let C be the union of the clusters of T and T' . Then, if the 
tree rearrangement corresponds to a 'nearest neighbour interchange' (NNI), one has [C]^fc = C. 
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However, if the rearrangement corresponds to moving one branch of T further across T than 
an NNI move ahows, the set [C]wk will, in general, be considerably larger than C. The former 
observation can be extended to allow more than one perturbation. In particular, if the NNI 
moves occur at 'well separated' nodes of T, the union of the clusters of T, along with the 
clusters of all the resulting perturbed trees will form a weak patchwork. In this way, weak 
patchwork closures may be a possible tool for helping to distinguishing local errors (due to lack 
of phylogenetic signal, or lineage sorting) from more extreme rearrangements events, as might 
occur with gene trees in settings where some lateral gene transfer events have occurred between 
distantly related taxa. 

(3) Assume that C is a weak and ample patchwork that contains Ctriv In view of Theorem [l| this 
implies that the hierarchy Co is contained in a maximal X-hierarchy that is contained in C. 
However, it is easy to see that Co need not be a maximal X-hierarchy itself, and that many 
distinct maximal X-hierarchies may contain Co. For example, C{X) is an ample patchwork, yet 
C{X)o consists of the trivial clusters only. So, it does not seem to be always possible to find a 
maximal X-hierarchy in an ample patchwork in a 'canonical' way. 

(4) It should also be of some interest to work out the corresponding theory for split systems rather 
than cluster systems, as well as for weighted cluster and split systems. For example, denoting 

(i) by ||C|| := |{C}*| = 2l^l + 2l^-^l+^ - 2, for any cluster C G C{X), the number of clusters 
C G C{X) that are compatible with C, 

(ii) by \ip\, for any map ip : C[X) — )• M, the ^i-norm Yl,ceC{x) V') 

(iii) and the set of all weighted cluster systems for X of ^i-norm 1 by 

Wi{X) := {ip G r'^P : \^\ = 1}, 

one could study the map 

Wi{X) Wi{X) -.ip^if* 
where 'P*{C) is defined, for any map ip G Wi(X) and any cluster C G C{X) by, say, 

EceC(x)\\CMCy 

a map that maps the (2" — 2)-dimensional simplex Wi{X) indeed into itself in view of 

cec(x) c'e{cy* c"gC(x) 

(5) Several interesting questions remain: 

- Is there another way to describe or characterize Co? In particular, can it be calculated in 
polynomial time in n and |C|? 

-Does An — 2 < \C*\ + \C**\ < 2" + nholdfor every non-empty cluster system C C C(X), i.e., are 
the cases where C is a maximal X-hierarchy and C = C{X) the 'extremal' cases when asking for 
the minimum and the maximum of |C*| + |C**| over all non-empty cluster systems C C C{X)? 

- As IJ^i. ext^'^*''(C) = [C]st is ample whenever C is an ample weak patchwork that contains Ctriv, 
does the same hold also for ext^^^\c), i.e., is ext^^^\c) ample whenever C is an ample weak 
patchwork that contains Ctriv'^ If yes, some rather detailed case-by-case considerations seem to 
be required to establish this fact. 

- Also, Proposition 7.2 tells us that [C]wk = ext^"^^j^(C) always holds, and it is natural to ask 



whether an analogous result holds for the other two patchwork-closure operators, too. In other 
words, can we find some less than exponentially growing integer function / = f{n) with [C]pt = 

ext^j^^^{C) or [C]st = 6xty|2)(C) cluster systems C C C{X) and, if so, how slowly can / 

grow as a function of n? 
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(6) In view of the investigations presented in [101, it may also be of interest to replace the binary 
relation '||' by any of the relaxed relations '||fc' {k £ N) defined by: 

^ll;.^ \AU B\ <m.ax{\A\,\B\) + k 

for all A,B £ C{X) and to study the associated 'duality theory'. 
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